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ABSTRACT 

Many recent studies have demonstrated that scaling arguments, such as the so-called 
hierarchical ansatz, are extremely useful in understanding the statistical properties of 
weak gravitational lensing. This is especially true on small angular scales (i.e. at high 
resolution), where the usual perturbative calculations of matter clustering no longer 
apply. We build on these studies in order to develop a complete picture of weak lens- 
ing at small smoothing angles. In particular, we study the full probability distribution 
function, bias and other multipoint statistics for the "hot spots" of the convergence 
field induced by weak lensing, and relate these to the statistics of overdense regions 
in underlying mass distribution. It is already known that weak lensing can constrain 
the background geometry of the Universe, but we further show that it can also pro- 
vide valuable information about the statistics of collapsed objects and the physics of 
collisionlcss clustering. Our results are particularly important for future observations 
which will, at least initially, focus on small smoothing angles. 

Key words: Cosmology: theory - large-scale structure of the Universe - Methods: 
analytical - Methods: statistical 



1 INTRODUCTION 

The images of high redshift galaxies can be distorted by the gravitational lensing of light passing through intervening density 
fluctuations. Even the relatively weak lensing produced by large-scale galaxy clustering can provide valuable information 
about the distribution of mass on cosmological scales, as recent feasibility studies have demonstrated (Bacon, Refregier & 
Ellis 2000; van Waerbeke et al. 2000). Gravitational lensing studies offer a particular advantage over traditional studies of 
large-scale structure that rely on galaxies as tracers of mass. The analysis of galaxy catalogues can only provide us with 
information as to how galaxies are clustered, and galaxies need not be accurate tracers of the mass distribution, owing to the 
intervention of bias. To infer statistical properties of the underlying mass distribution from galaxy catalogues one needs to 
understand fully the relationship between mass and light, which is far from trivial to unravel. Weak lensing is produced by 
the mass distribution, and can therefore be used to probe the underlying mass distribution directly without having to allow 
for bias (Mellier 1999; Bernardeau 1999; Bartelmann & Schneider 1999). 

After original suggestions by Gunn (1967), pioneering contributions on weak lensing were made by Blandford et al. 
(1991), Miralda-Escude (1991), and Kaiser (1992). More recent developments basically follow two strands. Some authors have 
focussed on the case where large smoothing angles are used. In this regime highly non-linear effects are washed out, allowing 
simpler linear and quasi-linear analyses to be performed (Villumsen 1996; Stebbins 1996; Bernardeau et al. 1997; Kaiser 
1998). A perturbative analysis can not be used to study lensing at small angular scales because the perturbation series begins 
to diverge as this limit is approached. The other approach that has been taken has involved the development of numerical 
experiments, usually using ray-tracing methods through N-body simulations (Schneider & Weiss 1988; Jarosszn'ski et al. 
1990; Lee & Paczyn'ski 1990; Jarosszn'ski 1991; Babul & Lee 1991; Bartelmann & Schneider 1991; Blandford et al. 1991). 
Building on the earlier work of Wambsganns et al. (1995, 1997, 1998) the most detailed numerical study of lensing so far 
was performed by Wambsganns, Cen & Ostriker (1998). Other recent studies involving ray-tracing experiments have been 
conducted by Premadi, Martel & Matzner (1998), van Waerbeke, Bernardeau & Mellier (1998), Bartelmann et al. (1998) and 
Couchman, Barber & Thomas (1998). 



* e-mail: mdipak@mpa-garching.mpg.de 
T e-mail: Peter.Coles@nottingham.ac.uk 



© 0000 RAS 



A complete statistical analysis of weak lensing on small angular scales is not available at present, cheifiy because no 
corresponding analysis exists for the underlying density field. There are, however, several non-linear ansatze available. These 
predict a tree hierarchy for the matter correlation functions and are successful in some respects at modelling the results from 
numerical simulations of gravitational clustering (Davis & Peebles 1977; Peebles 1980; Fry 1984; Fry & Peebles 1978; Szapudi 
& Szalay 1993, 1997; Scoccimarro & Frieman 1998; Scoccimarro et al. 1998). The different ansatze involved in these studies 
all involve a tree hierarchy, but disagree with each other in the way they assign weights to trees of same order but different 
topology (Balian & Schaeffer 1989; Bernardeau & Schaeffer 1992; Szapudi & Szalay 1993; Boschan, Szapudi & Szalay 1994). 
The overall scaling in such models, determined by the time evolution of the two-point correlation function, is left arbitrary. 
However recent studies by several authors (Hamilton et al 1991, Nityananda & Padmanabhan 1994, Jain, Mo & White 1995; 
Padmanabhan et al. 1996; Peacock & Dodds 1996) have furnished an accurate fitting formula for the evolution of the two-point 
correlation function which can be used in combination with the hierarchical ansatze to predict all clustering properties of the 
dark matter distribution in the universe. 

The statistical treatments of lensing that have been carried out so far have mainly concentrated on the properties of low- 
order cumulants (van Waerbeke, Bernardeau & Mellier 1998; Schneider et al. 1998; Hui 1999; Munshi & Coles 2000; Munshi 
& Jain 1999a), cumulant correlators (Munshi & Jain 1999b) and errors associated with their measurement from observational 
data (Reblinsky et al. 1999; Schneider et al. 1998). However it is well known that higher and higher order moments are more 
and more sensitive to the tail of the distribution from which they are derived represent and are also more sensitive to errors 
due to finite catalogue size (Colombi et al. 1995; Colombi et al. 1996; Szapudi & Colombi 1996; Hui & Gaztanaga 1998). 
On the other hand, numerical simulations involving ray tracing techniques have demonstrated that the distribution of lensed 
fluctuations can be of considerable assistance in the estimation of cosmological parameters from observational data (Jain & 
Seljak 1997; Jain, Seljak & White 1999; Jain & Van Waerbeke 1999). The probability distribution function associated with 
density field is not sensitive to cosmological parameters in the way its weakly-lensed counterpart is. 

Munshi & Jain (1999a, b) and Munshi (2000) have extended these studies to show that the hierarchical ansatz can actually 
be used to make concrete analytical predictions for all statistical properties of the convergence that results from weak lensing. 
Valageas (1999a) has also used the hierarchical ansatz to compute the error involved in estimating the cosmological parameters 
fio and Ao from supernova observations; see alo Wang (1999). In Valageas (1999b) the effect of smoothing was incorporated 
successfully to compute the PDF of the convergence field. 

In this paper we extend these analyses still further to develop a complete picture of statistical properties of the sky- 
projected density field obtained by weak-lensing surveys. In particular, we obtain an analytic prediction for probability 
distribution function (PDF) , bias and higher-order moments for regions where the projected density is particularly high. We 
also show how these the statistics of these 'hotspots' are related to similar quantities in the underlying mass distribution. 
These results can be used to understand the effect of changing source redshift, smoothing angle and cosmological parameters 
such as Q.0 and Ao. This will allow a more detailed exploration of cosmological parameter space than is possible with ray- 
tracing experiments. Several of the analytic results we discuss here have already been tested against high resolution numerical 
simulations and found to be in very good agreement. Finally, we will also show that our present understanding of gravitational 
clustering is sufficient to make firm predictions that can be tested using future observations. 

Throughout this analysis we will be neglecting noise due to source ellipticity and will also be neglecting source clustering. 
A complete error analysis of different statistics with these contributions taken into account will be presented elsewhere (Munshi 
& Coles 2000b). 

The paper is organized as follows. In Section 2 we present a detailed analysis of the PDF for smoothed convergence field 
k 3 . In Section 2 we present an analysis of the bias associated with peaks in smoothed convergence field k 3 . Sections 4-6 are 
devoted to the study of third-, fourth-, and fifth-order statistics (respectively) of the smoothed convergence field. In Section 7 
we generalize these results to arbitrary order and in section 8 we comment about our results in general cosmological setting. 



2 THE DENSITY PDF FROM CONVERGENCE MAPS 

Our formalism is based on a hierarchical ansatz for the matter correlation functions. In principle the entire set of A-point 
correlation functions must be computed by solving a coupled series of non-linear integro-differential equations known as 
BBGKY hierarchy. Unfortunately, no such solution exists at present. On the other hand, the Vlasov-Poisson system in the 
fluid limit does admit hierarchical scaling solutions. Motivated by this fact, several well-known hierarchical ansatze have been 
suggested which are very helpful in understanding the non-linear dynamics of gravitational clustering. One of these ansatz is 
particularly interesting, namely the one proposed by Balian & Schaeffer (1989) and developed considerably by Bernardeau 
& Schaeffer (1992). These studies are based on multi-point cell count statistics and their scaling properties. To apply it to 
lensing statistics is reasonably straightforward in principle, but requires some preliminaries. 
We adopt the following line element to describe the background geometry: 

dr 2 = -c 2 dt 2 + a 2 {t){d X 2 + r 2 ( X )d 2 n), (1) 

where a(t) is the expansion factor. The angular diameter distance r(x) = A -1 / 2 sm[K~ 1 / 2 x] for positive spatial curvature, 
r(x) = (— A) -1 ^ 2 sinh[(— A) _1 ^ 2 x] for negative curvature, and r(x) = X f° r a na t universe. In terms of Ho and Qo, A = 

(no - i)#o 2 - 

We shall focus on the properties of the convergence ^(7) produced by lensing. In the standard terminology of gravitational 



lensing this is simply the ratio of the surface mass density £(7) (observed in a direction 7 to the critical surface density for 
lensing E cr . In our background geometry this reduces to a weighted integral of the density fluctuation <J(x) along the line of 
sight: 

«(7) = / X d X u(x)6[r( X )n}- (2) 
Jo 

If all the sources are at the same redshift, one can express the weight function ui(x) = 3/4ac~ 2 H 2 fl m r(x) r (Xs ~ X)/ r {Xs)i 
where X s is the comoving radial distance to the source. (This approximation is not essential to the calculation, and it is easy 
to modify what follows for a more accurate description.) Using the definitions we have introduced above we can compute the 
projected variance of k in terms of the power spectrum of density fluctuations P(l) using 

(Limber 1954). The higher order moments of the convergence field can be written in the form: 

^) = S N J^X-^^ f t\ (4) 

(Munshi & Coles 2000a), where the Sn are defined by analogy with the hierarchical parameters usually used in the context 
of galaxy counts-in-cells. 

If the mean number of galaxies per cell is N and the volume-average of the two-point correlation function over the cell is 

Ia = ^ / /&(ri,r 2 )<ftW 2 , (5) 
then the generalisation of equation the previous equation 

Cat = yjf J ... J £w(ri . . . r N )dtt ...dV N . (6) 

leads to the description of higher-order statistical properties of galaxy counts in terms of the scaling parameters Sn constructed 
from the £jv via 

Sn = -p^-. (7) 

S2 

In equation (4) we have also introduced a new notation, namely that 
d 2 \ „ f I \ „, a . 



K9o = J WY P W) ) W {Wo) > (8) 

in order to take account of smoothing over an angular scale do- The function W(Wo) is the window function for the smoothing. 
In many studies a top-hat filter is used for smoothing the convergence field, but our study can be extended to compensated 
filters (Schneider et al. 1998; Reblinsky et al. 1999), which may be more appropriate for observational purposes. 

In order to compute the PDF of the smoothed convergence field k 3 , we will begin by constructing its associated cumulant 
generating function $ 1+f '(j/) 

Using the expression for the higher moments of the convergence field k s we can write 

$ iy) = y + J E-ivr ffw ^=i5w< 

We can now express $ in terms of the cumulant generating function of the matter distribution, 4>(y) using 

00 

in which the constants S p are theO hierarchical parameters discussed above. In terms of ^(y) we get 

,u{x) Ko„ 



* 1+K (y)= r r 2 ( X )dx 



r 2 (x) («1> 



V / w( X )dX- (12) 



The second term in eq. ([li]) comes from the N — 1 term in the expansion of $ 1+K . Note that we have used the fully non-linear 
generating function (j> v for the cumulants, though we will use it to construct a generating function in the quasi-linear regime. 
The analysis simplifies if we define a new reduced convergence field 77,, defined by 



where the minimum value of k s , denoted K m , occurs when the line of sight goes through regions that are completely devoid 
of matter (i.e. 8 = — 1 all along the line of sight): 

k„, - - ,' dxw(x). (14) 
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Although k(9q) depends on the smoothing angle, its minimum value n m depends only on the source redshift and back- 
ground geometry of the universe and is independent of the smoothing radius. In terms of the reduced convergence rj a , the 
cumulant generating function is given by, 



(x)dx 



K0 n 



r 2 (x) (k!> 



(15) 



The new cumulant generating function § v (y) satisfies the normalization constraints Si = & = 1. 

The scaling function associated with P v (r)) can now be related to the matter scaling function h(x) introduced in Munshi 
et al. (1999a) in the context of matter clustering. This function is defined in terms of the scaling variable x = N/N c , where 
N c = In the hierarchical ansatz the probability distribution P(N) can be expressed in scaled form as 



P(N) = 



■h(x) 



The quantity h(x) can then be expressed in terms of an integral of the form 



h(x) = - ; 



dy ya(y)exp(yx). 



2m , 

>J ICO 

This can be extended to the bias factor of cells of occupancy N via 
1 



P(N)b(N) 



h(x)b(x) 



and so on for higher-order statistics; see Munshi et al. (1999a) for more details. 
In the present context we instead define 



H*(x) = - f 

J — c 



2ni 



exp(xy)$ v (y). 



Using this definition we can write 



H*(x) = — 



r\x){4) 



{K 2 s )r 2 (x) x 



K e„ oj{x) «n 



(16) 



(17) 



(18) 



(19) 



(20) 



These equations apply to the most general case within the small angle approximation, but can be simplified considerably 
using further approximations. In the following we will assume that the contribution to the integrals involved in the expressions 
for x can be replaced by an average value coming from the maximum of lo(x), i.e. Xc (0 < Xc < Xs)- This idea leads to the 
following approximate expressions: 

|«m| ~ 7}XsU{Xc) 
< K s) ~ lxs^(Xc) 



d 2 k 



P(k)W 2 (kr( X c)e ) 



(21) 



Using these approximations we can write 

*"(v) = r(y) 

H"(x) = h"(x). 



(22) 



We thus find that the statistics of the smoothed underlying field and those of the reduced convergence r\ are exactly the same 
in this approximation. (The approximate functions and h v (x) do satisfy the correct normalization constraints, but we omit 
the details here.) 

Although it is possible to integrate the exact expressions of the scaling functions, there is some uncertainty involved in the 
actual determination of these functions and their associated parameters which must be inferred from numerical simulations. 
See Colombi et al. (1996), Munshi et al. (1999a) and Valageas et al. (1999) for a detailed description of the effect of finite 
volume corrections involved in their estimation. In numerical studies involving ray tracing simulations (Munshi & Jain 1999b) 
it has been found that the minimal hierarchical model of Bernardeau & Schaeffer (1992), which has only one free parameter 
to be determined from numerical simulations, can reproduce results from simulations very accurately. 

Another useful approach is to construct an Edgeworth expansion of the PDF, starting with the simpler form P(rj s )- This 
can then be used to construct a series for P(k 3 ). The Edgeworth expansion (Bernardeau & Kofman 1994) is meaningful when 



the variance is less than unity, a condition that guarantees a convergent series expansion in terms of Hermite polynomials 
H n (v), of order n and with v — r) s / U (£ Vs ) -The relevant series expansion is 



(23) 



This is usually applied to a quasi-linear analysis of matter clustering using perturbative expressions for the cumulants. 
However, the parameters needed in the expansion of P(rf) are from the highly non-linear regime. Although the variance 
is still smaller than unity, the parameters that characterize it emerge from the highly non-linear dynamics of the underlying 
dark matter distribution. 

The magnification /i can also be used instead of k according to the weak lensing relation, fi s = 1 + 2n s . Its minimum 
value can be related to K m defined earlier via /i m = 1 + 2/t m . Finally, the reduced convergence 7] and the magnification \i can 
be related by the following equation: 

V. = (24) 

(Valageas 1999). We can now express the relations connecting the probability distribution function for the smoothed conver- 
gence statistics k s , the reduced convergence r] 3 and the magnification fi s as, 

P k (k s ) = 2P»{n.) = P"(7?.) . 1 2 , = P v M^- r (25) 

\± f^m ) |^m| 

The formalism which we have developed for one-point statistics such as the PDF and the VPF can also be extended to 
compute the bias and higher order cumulants associated with spots in k maps above a certain threshold. The statistics of 
such spots can be associated with the statistics of over-dense regions in the underlying mass distribution representing the 
collapsed objects. A detailed analysis of these issues will be presented elsewhere (Munshi & Coles 2000b). 



3 BIAS OF COLLAPSED OBJECTS FROM CONVERGENCE MAPS 



In order To compute the bias associated with the peaks in the convergence field we must first develop an analytic expression for 
the joint generating function /3(yi , J/2 ) for the convergence field k s - For that we will use the usual definition for the two-point 
cumulant correlator C pq of the convergence field 

M7i) P M72)"> 



(k 2 s) 



2\ p+q -2 



{k s (7i)k s (7 2 )> 



(26) 



For a complete treatment of lower order moments of k s , see Munshi & Coles (2000a). Like its counterpart for the density 
field, the two-point generating function of the convergence field can be expressed as a product of two one-point generating 
functions: 



2/^(2/1,2/2) 



a 



c 



p\q\ 



pi 



(27) 



where the function r K will be used later. The factorization property of the generating function depends on the factorization 
property of the cumulant correlators: C^ q = C^C^. Such a factorization is possible when the correlation between two patches 
in the directions 71 and 72 is smaller than the variance on the scale of one patch. Using this and starting with 



00 

= T—- 



ylyl <M7i) P M72) 9 > 



p\q\ (k 2 s }p+i- 2 <k s (7i)k 3 (7 2 )) 



(28) 



and then inserting the integral expression for the cumulant correlators in the hierarchical ansatz we obtain 



2 /?K (j/l, J/2) 



cm 



^ p\q\ 



(kI)p+*-* < Ks ( 7 i)M72)> J dX r2(p+ 9 -i) K " 12 ^o 



p-\-q — 2 p q 

2/12/2- 



(29) 



This expression involves the definition 



K 6l2 



d 2 \ 



i 

r (x) 



W 2 (W )exp(iie 12 ) 



(30) 



It is possible to simplify this approximation further by grouping the summations over dummy variables p and q. This is 
useful to establish the factorization property of the two-point (joint) generating function for bias 2/9(2/1,2/2)- Note that 



2 13 K (2/1, 2/2) 



" /" 
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r (x)dx 



fte 12 



(k s (7i)M72)) 



-2 00 

p\q\ 

pi 



2/1 



2 (X) («1> 



2/2- 



2 (x) («2> 



(31) 



b(Tl) 
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b(Tl ) 



b(k) = b(ri ) / k 
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1/K 





b(k) = b(ri ) / k 



1/K 



Figure 1. Construction of bias function b(K s ) for the smoothed convergence field from the bias b(r] s ) associated with the smoothed 
reduced convergence field rj s . 



We can now decompose the double sum over the two indices into two separate sums over individual indices. Finally using the 
definition of the one-point generating function for the cumulant correlators we can write: 



2/3 K (2/i,2/2) 



(X)dx 



(k s (7i)M72)) 



k 0q 
<"2> 



2/1- 



w(x) 



2 (x) («I> 



y-2 



r 2 (x) 



(32) 



The above expression is quite general. It depends on the small-angle and large separation approximations, but is valid for 
any particular model for the tree-correlation hierarchy. However it can be seen that the projection effects encoded in the 
line-of-sight integration do not allow us to express the two-point generating function /3,, (2/1, 2/2) simply as a product of two 
one-point generating functions f3 v (y) as can be done in the case of the underlying density field. 

As in the case of the derivation of the probability distribution function for the smoothed convergence field k s , matters 
simplfy considerably if we use the reduced smoothed convergence field ij s - We have already shown that the PDF associated 
with r/ s and the underlying mass distribution are the same under certain approximations. An identical ressult result indeed 
holds good for higher order statistics, including the bias. The cosmological dependence of the statistics of k 3 field is encoded in 
k m and the choice of the new variable 77 renders its related statistics almost independent of background cosmology. Repeating 
the above analysis again for the r) s field now we can express the cumulant correlator generating function for the reduced 
convergence field r/s as follows: 

-2 



2^(2/1,2/2) 



1 f X \: 

[Km] 2 J Q 



(X)dx 



(k s (7i)k s (7 2 )) 



K0n 



2/1 Kr 



2 (x) («2> 



1/2 Kr 



> 2 (x) M> 



(33) 



Although the above expression is very accurate and describes an important relationship between the density field and the 
convergence field, it is difficult to use for any practical purpose. It is also important to note that scaling functions such as h(x) 
for the density probability distribution function and b(x) for the bias associated with over-dense objects are typically estimated 
from numerical simulations, especially in the highly non-linear regime. Such estimations are subject to many uncertainties, 
such as the finite size of the simulation box. It has been noted in earlier studies that such uncertainties lead to large errors 
in estimates of h(x). Estimates of the scaling function associated with the bias b(x) is even more complicated, owing to the 
fact that these quantities are even worse affected finite size of the catalogues. It is consequently not fruitful to integrate the 
exact integral expression we have derived above. Instead we will replace all line-of-sight integrals with approximate values. An 
exactly similar approximation was used by Munshi & Jain (1999a) to simplify the one-point probability distribution function 
for k s , and it was found to be in good agreement with numerical simulations. In this approximation 



1 2 

(M7OM72)) ~ 2 XsUJ ( Xc ) 



' rk ; P(k)W 2 (kr( Xc 



8 ) exp[ifcr(xc)6 l i2] 



(34) 



_(27T) 2 ' 

This gives us the leading order contributions to the integrals needed and we can also check that, to this order, we recover the 
factorization property of the generating function, i.e. 

2^(2/1,2/2) = T v (yi)T r '(y 2 ) = ri +5 (2/i)Ti +5 (2/ 2 ) = r(i/i)r(y 2 ). (35) 

It is also interesting to note that C^j = C^ tl n m p ^ 1 . 

In this level of approximation, the factorization property of the cumulant correlators means that the bias function b(x) 
associated with the peaks above a given threshold in the convergence field k s also obey a factorization property, just as is the 
case in the density field counterpart (e.g. Coles, Munshi & Melott 2000): 

b v (xi)h v (xi)^ (X2)h v (x 2 ) = b 1+s {x 1 )h 1+s (xi)b 1+s (x 2 )h 1+ s{x2), (36) 
in which we have used the relation between f3(y) and b(x) given above: 



b v (x)h v (x) 



1 

"2^i 



dyr(y)exp(xy). 



(37) 



We established a similar correspondence between the convergence field and density field in the case of one-point probability 
distribution function in the previous section. 




Figure 2. Construction of the bias function b(n B ) for the smoothed convergence field from the bias b(r] s ) associated with the smoothed 
reduced convergence field r? s . 



The differential bias, i.e. the bias associated with a particular overdensity, is much more difficult to measure from 
numerical simulations than its integral counterpart. From now on we therefore concentrate on the bias associated with peaks 
above certain threshold, which can be expressed in a similar form to that given above: 

V(>x)h"{>x) = ~ [ dy^-exp(xy). (38) 

<J — loo ° 

(Munshi et al. 1999a) . Although the bias b(x) associated with the convergence field and that related to the underlying density 
field are exactly equal, the variance associated with the density field is very much higher than that of the convergence field. 
Projection effects bring down the latter to less than unity, meaning that we have to use the integral definition of bias to 
recover it from its generating function (see eq.(^) and eq.(^)). 

We can now write down the full two-point probability distribution function for two correlated spots in terms of the 
convergence field n and its reduced version r\: 

p K (ki, K2)dnid,K2 = p K (ni)p K (K2) [1 + b K (K^^b" (K2)] dnidK2 

pi(Vi,m)dVid V 2 = p"(r ?1 )p' ? fe)[l + 6' 7 (r 7l )^ 2 &' ? fe)]dr ?1 d 772 . (39) 

Following from the analysis presented in the previous section, we note that p(k s ) = p{rj s )/k m and that ff 2 = ^ 2 / K m- Using 
these relations we can now write 

*"(«) = (40) 

This is one of the main results of this analysis and has already been shown by Munshi (2000) to be in very good agreement 
with numerical ray tracing simulations. 



4 THREE-POINT STATISTICS OF COLLAPSED OBJECTS FROM CONVERGENCE MAPS 

A non-linear ansatz for higher-order correlations can be used not only to compute the correlation hierarchy for the underlying 
mass distribution but also the multi-point statistics of overdense cells, which represent collapsed objects (Bernardeau & 
Schaeffer 1992, 1999; Munshi et al. 199a, b,c; Coles et al. 1999). Whereas such studies generally concentrate mainly on three- 
dimensional statistical properties our aim in this paper is to investigate how much one can learn from such ansatze about 
the projected density field obtained from weak lensing surveys. In earlier sections we showed that the bias associated with 
peaks in projected density field can be very accurately modelled. We will now extend these results using multi-point cumulant 
correlators to show that such an analysis can also be performed for the skewness of overdense objects in the projected density 
field. 

The bias of collapsed objects is connected with the two-point cumulant correlators of the underlying mass distribution. 
In a very similar fashion, the three-point cumulant correlators of underlying mass distribution can be related to the skewness 
of collapsed objects. The three-point cumulant correlators for projected mass distribution, smoothed with an angle 9q can be 
written as (Munshi & Coles 2000a) 

fXs p+q + r 

M7i) p «.(72) , « s (73) r >c = / d x " +r _ 1} [C>fl 12 C^ 3 aVC 9 + cycperm.]. (41) 

Jo r 

The three-point cumulant correlators for the projected density field can be defined by the following equation: 

M7i) P ^(72) 9 M73) r >c 



C 



/ ^(7i)^(72)) c (^(72)^(73)) c (^ 2 (7))? +9+r " 3 

P + q + r (p+9+r- 3) 

X r 2(p+q+r-l) %1 <K 3 ( 7l K(72)) c 911 <M72K(73))c rl («J {-j))^^^ ' 



In what follows we present only one representative term for each topology at each order; other terms can be obtained by 
cyclic permutation of indices. This simplifies the presentation because the result of the final analysis can also be achieved by 
simple cyclic permutations of indices. Note that, because of the line-of-sight integration we can not decompose the generating 
function as a product of three generating functions of single variables. 
Introducing the new variable rj as before we find that 
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The generating function for the convergence field k s can then be written as a summation over different indices: 
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Making a change of variables from convergence k s to reduced convergence r) s we can write 
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As we mentioned above, it is possible to integrate the above equation (which is derived using only small angle approx- 
imation). However, note that due to line-of-sight effects it is no longer possible to separate the cumulant correlators of the 
convergence field in the same way as can be done for the underlying mass distribution. Using the approximations for the 
expressions which we have already used to simplify the generating function for two-point cumulant correlators, i.e. 
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we can write 
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So at this level of approximation we again obtain a factorization property of the cumulant correlators for the reduced 
convergence field. Also notice that such an approximation would man that the third order reduced cumulant correlators 
for convergence field can be related to the corresponding quantity for the underlying mass distribution by 
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The three-point joint PDF for the convergence field and the reduced convergence field now can be expressed as 
p K (fti, K2, K 3 ) = p K {ki)p k (k 2 )p k (K3) [1 + & K (ki)Ci2& K (k2) + b K (ki )Ci2^2 (^2)^23^ («s) + cyc.pcrm.] 

P v (Vi,V2,V 3 ) = P v (m)p V (V2)p v (m) [I + ^(m^iVM + b^MM^M + cyc.perm.] (49) 

The generating function for the reduced cumulant correlator C^ n , i.e. (J^iv) an< i its associated scaling function u^(x) (see 
Munshi et al. 1999a,b) can be related by 
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where the scaling variable x has the same meaning as defined earlier. One can also derive a similar expression for cumulative 
v% (> x), i.e. v% (> x) beyond a certain threshold (see Munshi, Coles, Melott 1999a,b; Munshi, Melott & Coles 2000; Munshi 
et al. 1999; Munshi & Melott 1998 for details). Using the fact that p(/t) = p(rj)/K m ; b(/t) = b(r))/K m and £ k — £^/^m we can 
finally obtain 
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Since the scaling function v% (> x) encodes information concerning the skewness of collapsed objects beyond a certain threshold 



b (n) b (k) = b (ti) / k. 




b (T|) b (T|) b (K) = b (ri)/ k b (K) = b (T|) /k 



Figure 3. Generating the third order vertex v^(k) from its reduced convergence counterpart u^(t]). Notice that every link in the r\ 
diagram corresponds to an extra factor of l/zt^, in the k diagram and hence one of these 1/K m gets attached to one of the two vertices 
connected by such links. Such a simplification however is possible only after some approximations, as described in the text. 



or hot-spots in convergence maps it is interesting to notice how such a quantity is directly related to the three-point cumulant 
correlators of the background convergence field. We can write the skewness S3 (> n s ) of those spots in convergence maps 
which cross certain threshold 

s * (> Ks) = V(>^ = 3 b^r = 3 ( Vs) - ( } 

It is easy to notice that, independent of cosmology, spots which cross certain threshold rj s in reduced smoothed convergence 
will have exactly same skewness as the skewness of the overdense regions of the mass distribution 1 + S = ^ crossing the 
same threshold. This is one of the main results in our analysis, in next sections we will show that this results holds good to 
higher order. Indeed, it turns out that even the PDF of collapsed objects and "hot-spots" in reduced convergence maps will 
be exactly equal. 



5 FOUR-POINT STATISTICS OF COLLAPSED OBJECTS FROM CONVERGENCE MAPS 



Four-point cumulant correlators can be analyzed in an exactly similar manner as in the previous section, except that in this 
case there are two distinct topologies ( "snake" and "star" topologies) which make contributions to the correlations. The star 
topology possesses a completely new vertex of third order, the snake topology (although of same order) is made of lower order 
tree vertices. Therefore it also provides a unique consistency check of the lower-order diagrams. 

The most general four-point cumulant correlator of arbitrary order can be expressed as (Munshi, Melott & Coles 1999a): 
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We will use the same superscript ij to denote both the reduced convergence field and the underlying mass distribution. Both 
fields display similar statistical properties, at least to the level of approximation used here. Developing the four-point function 
further yields 
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The snake contribution to four-point cumulant correlator can now be written 
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Considering the generating function for the snake terms, we obtain the following expression: 
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Once again, it is not possible to factorize the generating function because of line-of-sight contributions. After changing variables 
from k to t/ we can write 
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Similarly, for terms with star topologies, we can write 
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Finally the generating function 4/?« tar (l/i, 1/2, 2/3, 2te) can be written as: 
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We can now change variables from n s to rj s and write: 
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The four-point joint probability distribution function can not be factorized as in the case of underlying mass distribution, but 
using the same approximation as we have deployed throughout we can simplify these terms and write 
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M3(2/i)Mi(2/2)Mi (2/3)Mi(l/4), 
which implies that, as in the lower-order reduced cumulant correlators, we can write 

c P V = («-) p+1 c; 111 . (62) 

The joint probability distribution function for the convergence field and the reduced convergence field now can be written 
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The generating function for reduced cumulant correlator C p n, i.e. fi2(y) and v\(x) can be related by 
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where the scaling variable x has the same meaning as defined earlier. One can also derive a similar expression for the cumulative 



v$(> x ), i- e - v s, (> x ) beyond a certain threshold; see Munshi et al. (1999a,b,c) for details. Using the fact that p(/t) = p(r))/K m ; 
6(k) = b(r/)/K m ; V2{k) = vi{r])lK m and £ K = we can finally obtain 

„ 3>) = (65 ) 

Since the scaling function vz{> x) encodes information on the kurtosis kurtosis of collapsed objects beyond certain threshold, 
or hot-spots in convergence maps, it is interesting to notice how such a quantity is directly related to the four-point cumulant 
correlators ("star" -diagrams) of the background convergence field. We can write the kurtosis Si(> n s ) of those spots in 
convergence maps which cross certain threshold 

SU> «.) = + 12#^g = 4-^-4 + 12^1. (66) 

y ' b K (> k s ) 3 b K (>K s ) 4 bv(> Va ) 3 b^(>r/ s ) 4 y ' 

As in the case of skewness, the kurtosis associated with points which cross certain threshold in convergence map will have 
precisely identical statistical properties as in the underlying density field. 



6 FIVE-POINT STATISTICS OF COLLPASED OBJECTS FROM CONVERGENCE MAPS 



The five-point cumulant correlators can be related to the fifth order moments of collapsed objects in a very similar manner 
to the preceding calculations, so we will simply quote the results of such an analysis in this section. The complication here is 
that, in addition to having star and snake topologies, there is an additional hybrid topology in this case. 

For star topologies, the appropriate integrals will involve five node functions. Four of these will be of the form v\ , and 
one V4- There will also be four links between these nodes denoting the correlations. Hence we can write 
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For a snake topology the relevant contributions will come from three generating functions of three-point reduced cumulant 
correlators C p n denoted as before by 2*2(2/), and two generating function of two-point reduced cumulant correlators C p i which 
we have denoted by fii(y): 
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For the hybrid topology the contributions will come from the generating function of four-point reduced cumulant corre- 
lators Cpin denoted as before by H2(y), three-point reduced cumulant correlator C p n and the rest of the nodes 2x2(2/) will be 
generating function for two-point cumulant correlators C v \. So finally we can write 
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Making a change of variables again from the convergence field k s to the reduced convergence field r) 3 we obtain the 
following expressions for the five-point generating functions which, again, cannot be factorised. For the star topology we have 
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Figure 4. Transformation of nodes representing bright spots in convergence maps for a change of variables from convergence k s to 
reduced convergence r/ B . However such a decomposition is possible only under certain simplifying assumption (see text for details). 
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For the snake topology the relevant expression is 
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and similarly for the hybrid topology: 
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Using the same approximations as we have used before for the lower-order cumulants, we can see that the generating 
function ^4(2/) and its associated scaling function f 4 (x) for the reduced convergence field are, once again, exactly the same as 
the one for underlying mass distribution. This leads to 
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As in the case of the fourth-order cumulant correlators, these results now can be used to express the five-point joint probability 
distribution function for both convergence and reduced convergence field. Clearly the new vertex ^4(2/) and associated scaling 
function v^ix) (or the analogous V4,{rj s ) or V4(k s )) can be used to express the fifth-order cumulant for collapsed objects: 
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Finally the fifth order one-point moment of the collapsed objects is 
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7 GENERALIZATION TO ARBITRARY ORDER STATISTICS 

Let us now put together what we have learned so far and explain what can be learned about correlations of arbitrary order. 
If we consider a diagram of arbitrary topology and of arbitrary order it will consist of different kinds of nodes. These nodes 
will be represented by associated scaling functions fi s (y) and act as generating functions for the reduced cumulant correlators 



of various order C v ...\. The term will also consist of products of the factors which represent correlation function associated 
with the links which join these nodes. Products of these terms when integrated along the line of sight direction will provide 
us the joint generating function of arbitary order for convergence field k s : 
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Changing the variables from convergence field k s to reduced convergence r) s we can write 
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As before, this is the most general expression and includes all other results which we have obtained so far. We want to 
emphasize that deriving this result we have not used any approximation other than that of small angles and these integrals 
can be computed numerically to obtain the joint multi-point PDF for convergence fields. 

Given a specific model for the underlying mass distribution we can now relate all the multi-point properties of the 
projected density field to be obtained from weak lensing surveys. However, as we indicated before, with some approximations 
it is possible to simplify these results still further. This gives us an interesting insight into the statistical nature of the 
underlying mass distribution: 
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These approximations have already been tested against the results of high resolution ray tracing experiments (Munshi & Jain 
1999a, b; Munshi 1999)which have confirmed that these simplifications indeed reproduce the results in a remarkably accurate 
way. 

The vertices associated with collapsed objects Hn(y), can be linked to generating functions for the vertices in the matter 
correlation hierarchy in a very interesting way. This has been done in an order-by-order expansion (Bernardeau & Schaeffer 
1992; Munshi, Coles & Melott 1999a) and also to arbitrary order in a non-perturbative approach (Bernardeau & Schaeffer 
1999). These techniques were developed to compute the joint multi-point statistics of collisionless clustering. Our study shows 
that the statistics of reduced convergence field are deeply linked to the statistics of underlying mass distribution. However, 
we should keep in mind that although the same functional form for v n {x) holds for both reduced convergence field and the 
underlying mass distribution the variance associated with them are very different. The variance for the mass distribution is 
very high on small length scales, while the variance of reduced convergence field is small. This means that while it is possible 
to compute asymptotic forms for the scaling functions (such as v n {x)) for very small variance, complete information about 
Vn{x) can only be obtained by a numerical integrations. This has already been done by Munshi & Jain (1999a), Munshi (2000) 
for the case of PDF and bias. 



8 DISCUSSION 

In this paper we have explored the statistics of regions where the lensing convergence exceeds some threshold value. Our 
main purpose in this is to demonstrate the deep connection between statistical descriptors of the convergence and those of 
the underlying mass. 

Ongoing weak lensing surveys with wide field CCD are likely to produce shear maps on areas of order 10 square degrees or 
less; existing feasibility studies cover relatively small angles (Bacon et al. 2000; van Waerbeke et al. 2000). In the future, areas 
larger than 10° x 10° are also feasible, e.g. from the MEGACAM camera on the Canada France Hawaii Telescope and the 
VLT Survey Telescope. Ongoing optical (SDSS; Stebbins et al. 1997) and radio (FIRST; Refregier et al. 1997) surveys can also 
provide useful imaging data for weak lensing surveys. Such surveys will provide maps of the projected density of the universe 
and thus will help us both to test different dark matter models and probe the background geometry of the universe. They 
will also provide a unique opportunity to test different ansatze for gravitational clustering in the highly non-linear regime in 
an unbiased way. Traditionally such studies have used galaxy surveys, with the disadvantage that galaxies are biased tracers 
of the underlying mass distribution. 

Most previous studies in weak lensing statistics used a perturbative formalism, which is applicable in the quasilincar 
regime and thus requires large smoothing angles. To reach the quasi-linear regime, survey regions must exceed areas of order 
10 square degrees. Since existing CCD cameras typically have diameters of 0.25° — 0.5°, the initial weak lensing surveys are 
likely to provide us statistical information on small smoothing angles, of order 10' and less. This makes the use of perturbative 
techniques a serious limitation, as the relevant physical length scales are in the highly nonlinear regime. In our earlier studies 
(Munshi & Coles 2000a) we have shown how lower order moments, such as cumulants and cumulant correlators, can also be 
extended to the case of weak lensing surveys. Although an important step towards understanding the physics of gravitational 
clustering and constraining the cosmological parameters, this work did not represent a complete picture of the entire PDF and 
multi-point generalisations of the cumulants and cumulant correlators. This was another aim of the present study. Extending 



our earlier results we have shown that indeed such results can be obtained once we assume a hierarchical model for the 
underlying mass distribution. 

Numerical comparison of the one point probability distribution function (PDF) and bias of the convergence field have 
already been carried out by Munshi & Jain (1999a,b). The success of analytical results presented here indicates that the PDF, 
bias and other higher order statistics for "hot-spots" in convergence maps for a desired cosmological model can be computed 
as a function of smoothing angle and redshift distribution. Thus physical effects, such as the magnification distribution for 
Type la supernovae, can be conveniently computed. Thus we have a complete analytical description, based on models for 
gravitational clustering, for the full set of statistics of interest for weak lensing - the one point pdf, two point correlations, 
and the hierarchy of higher order cumulants and cumulant correlators. This analytical description has powerful applications 
in making predictions for a variety of models and varying the smoothing angle and redshift distribution of source galaxies; 
numerical studies are far more limited in the parameter space that can be explored. 

The lower-order statistics of "hot-spots" in the convergence field have also already been studied by (Munshi & Jain 
1999a, b). They demonstrated that, given a threshold, it is possible to link the statistics of collapsed objects with the 'hot- 
spots' in convergence field. It was also established that both one-point objects (such as PDF) and two-point objects (such 
as bias) measured from ray tracing experiments reproduce the analytical predictions with very good accuracy. This strongly 
motivates this present study, which develops these ideas further. Clearly the hierarchical ansatz has not been tested against 
N-body simulation in its full generality. While we have a very good analytical models for one-point scaling functions (Munshi 
et al. 1999a,b) which have been tested against numerical simulations, corresponding studies for two-point quantities are still 
lacking. One expects that situation will improve with availability of larger N-body simulations. 

As we found that a suitable transformations of variables from convergence to reduced convergence can make the analysis 
very simple. Moreover we found that the reduced convergence has exactly the same statistics as the underlying mass distribu- 
tion under some simplifying assumptions. However, in general the generating functions do not obey a factorization property 
similar to their counterpart for underlying mass distribution due to a line of sight integration. In general the minimum value 

of Ht 1.6. fcrn 

plays an important role in the transformation. 

The dependence of convergence statistics on parameters of the cosmological background model enters through their effect 
on K m . This is why the statistics of r/ s are not sensitive to the background geometry or dynamics, but the statistics of 
convergence k are very sensitive to parameters such as £1 and A. This implies that it is possible to determine or constrain 
these parameters using weak lensing surveys. Numerical evaluation of this approach has already been done by Munshi & 
Jain (1999a,b) and Munshi (2000) who found very good agreement between analytical predictions and numerical simulations. 
While the statistics of the convergence field is very sensitive to the cosmological parameters, the Sjv parameters of bright spots 
which correspond to the collapsed objects in mass distribution are independent of such parameters and are approximately the 
same as those of collapsed objects; the bias of these bright-spots does depend on cosmplogical parameters. 

Our studies also indicate that several approximations involved in weak lensing studies are valid even in the highly 
non- linear regime of observational interest. While a weakly clustered dark matter distribution is expected to produce small 
deflections of photon trajectories, it is not clear if the effect of highly over-dense regions capable of producing large deflections 
can also be modeled using a hierarchical ansatz and the weak lensing approximation. Numerical studies have shown that this 
is indeed the case; the effect of density inhomogeneities on very small angular scales can be predicted with high accuracy using 
analytical approximations, which confirms for example that the Born approximation is valid in the highly nonlinear regime. 

The finite size of weak lensing catalogs will play an important role in the determination of cosmological parameters. It is 
therefore of interest to incorporate such effects in future analysis. Noise due to the intrinsic ellipticities of lensed galaxies will 
also need to be modeled. The present study is based on a top-hat window function which is easier to incorporate in analytical 
computations. It is possible to extend our study to other statistical estimators such as M ap , which use a compensated filter 
to smooth the shear field and may be more suitable for observational studies (Schneider et al 1997; Reblinsky et al 1999). 
Our method of computing cumulants and cumulant correlators or PDF and bias can also be generalized to the case of other 
window functions and we hope to present such results elsewhere. 

It is interesting to consider an alternative to the hierarchical ansatz for the distribution of dark matter in the highly 
nonlinear regime. The dark matter can be modelled as belonging to haloes with a mass function given by the Press-Schechter 
formalism, and spatial distribution modeled as in Mo & White (1996). When supplemented by a radial profile for the dark 
halos such a prescription can be used to compute the one-point cumulants of the convergence field. Reversing the argument, 
given the one point cumulants of the convergence field it is possible to estimate the statistics of dark haloes. 
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